Joshua 4.0: Packing, PRO, and Paraphrases

نویسندگان

  • Juri Ganitkevitch
  • Yuan Cao
  • Jonathan Weese
  • Matt Post
  • Chris Callison-Burch
چکیده

We present Joshua 4.0, the newest version of our open-source decoder for parsing-based statistical machine translation. The main contributions in this release are the introduction of a compact grammar representation based on packed tries, and the integration of our implementation of pairwise ranking optimization, J-PRO. We further present the extension of the Thrax SCFG grammar extractor to pivot-based extraction of syntactically informed sentential paraphrases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages

With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique and existing methods suffer from lack of producing accurate queries, Precision and Speed of retrieved result...

متن کامل

Analysis of designed β-hairpin peptides: molecular conformation and packing in crystals.

The crystal structures of several designed peptide hairpins have been determined in order to establish features of molecular conformations and modes of aggregation in the crystals. Hairpin formation has been induced using a centrally positioned (D)Pro-Xxx segment (Xxx = (L)Pro, Aib, Ac6c, Ala; Aib = α-aminoisobutyric acid; Ac6c = 1-aminocyclohexane-1-carboxylic acid). Structures of the peptides...

متن کامل

A Comparison of Optimization Methods for Multi-objective Constrained Bin Packing Problems

Despite the existence of e cient solution methods for bin packing problems, in practice these seldom occur in such a pure form but feature instead various considerations such as pairwise con icts or pro ts between items, or aiming for balanced loads amongst the bins. The Wedding Seating Problem is a combinatorial optimization problem incorporating elements of bin packing with con icts, bin pack...

متن کامل

Computational Study of Packing a Collagen-Like

The lateral packing of a collagen-like molecule, CHJO(Gly-L-Pro-L-Pro),NHCH,, has been examined by energy minimization with the ECEPPN force jield. Two current packing models, the Smith collagen microjibril twisted equilateral pentagonal model and the quasi-hexagonal packing model, have been extensively investigated. In treating the Smith microjibril model, energy minimization was carried out o...

متن کامل

Extracting Paraphrases from a Parallel Corpus

While paraphrasing is critical both for interpretation and generation of natural language, current systems use manual or semi-automatic methods to collect paraphrases. We present an unsupervised learning algorithm for identification of paraphrases from a corpus of multiple English translations of the same source text. Our approach yields phrasal and single word lexical paraphrases as well as sy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012